# Multi-dataset fine-tuning
Whisper Large V3 Vaani Hindi
Apache-2.0
A Hindi speech recognition model fine-tuned based on OpenAI's Whisper-Large-V3, trained on approximately 718 hours of transcribed Hindi speech data
Speech Recognition
Safetensors
W
ARTPARK-IISc
15.55k
3
Vi Whisper Large V3 Turbo V1
Whisper-V3-Turbo model optimized for Vietnamese automatic speech recognition (ASR) tasks, fine-tuned using multiple Vietnamese datasets
Speech Recognition
Transformers Other

V
suzii
182
7
XLSR WithLM Malayalam
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam training datasets, supporting automatic speech recognition for Malayalam.
Speech Recognition
Transformers

X
kavyamanohar
19
4
Xttsv2 Hi Ft
A Hindi text-to-speech model fine-tuned from a forked version of Coqui TTS, supporting Hindi and English speech synthesis with Hindi accent
Speech Synthesis
Transformers Supports Multiple Languages

X
AOLCDROM
26
6
Vit Facial Expression Recognition
A ViT-based facial expression recognition model fine-tuned on FER2013, MMI, and AffectNet datasets, capable of recognizing seven basic emotions
Face-related
Transformers

V
motheecreator
4,221
13
Vit Facial Expression Recognition
ViT-based facial expression recognition model, fine-tuned on FER2013, MMI, and AffectNet datasets, supporting seven emotion classifications
Face-related
Transformers

V
mo-thecreator
8,730
16
Kobart Summary V3
Korean text summarization model fine-tuned based on kobart, generating summaries with more short sentences
Text Generation
Transformers Korean

K
EbanLee
5,139
17
Matter 0.1 7B GGUF
Apache-2.0
Matter 7B is a fine-tuned model based on Mistral 7B, designed for text generation tasks, supporting conversational interaction and function calling.
Large Language Model English
M
munish0838
127
1
Marian Finetuned Multidataset Kin To En
Apache-2.0
This model is a fine-tuned machine translation model for Kinyarwanda to English, based on Helsinki-NLP/opus-mt-rw-en
Machine Translation
Transformers

M
RogerB
64
0
Wav2vec2 Large Xlsr 53 Japanese
Apache-2.0
Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Speech Recognition
Transformers Japanese

W
Ivydata
19
4
Whisper Small Khmer V2
Apache-2.0
A Khmer speech recognition model fine-tuned based on OpenAI Whisper-small, trained on OpenSLR, Google FLEURS, and km-speech-corpus datasets
Speech Recognition Other
W
seanghay
35
3
Flan T5 3b Summarizer
Bsd-3-clause
A general-purpose summarizer based on the 3B-parameter google/flan-t5-xl model fine-tuned on multiple summarization datasets, suitable for academic and general scenarios.
Text Generation
Transformers English

F
jordiclive
231
36
Whisper Telugu Medium
Apache-2.0
Telugu speech recognition model fine-tuned based on OpenAI Whisper-medium, trained on multiple public Telugu ASR datasets
Speech Recognition Other
W
vasista22
228
2
Whisper Large V2 Mix Jp
Apache-2.0
An automatic speech recognition (ASR) model fine-tuned on Japanese speech datasets based on OpenAI Whisper-large-v2
Speech Recognition
Transformers

W
vumichien
93
9
Whisper Medium Da
Apache-2.0
A Danish automatic speech recognition (ASR) model fine-tuned based on OpenAI Whisper Medium, trained on Common Voice 11 and FLEURS datasets
Speech Recognition
Transformers Other

W
jstoone
22
5
Whisper Th Medium Combined
Apache-2.0
Fine-tuned on an enhanced Thai dataset based on openai/whisper-medium for Thai automatic speech recognition
Speech Recognition
Transformers

W
biodatlab
4,167
17
Whisper Large V2 Pl V2
An automatic speech recognition model fine-tuned on Polish datasets based on Whisper Large v2, supporting Polish speech-to-text tasks.
Speech Recognition
Transformers Other

W
bardsai
217
6
Whisper Medium Id
Apache-2.0
A speech recognition model fine-tuned on Indonesian datasets based on openai/whisper-medium, significantly improving the accuracy of Indonesian recognition.
Speech Recognition
Transformers Other

W
cahya
1,961
21
Legal BERTimbau Sts Large Ma V3
Portuguese legal domain sentence similarity model based on BERTimbau large model, supporting 1024-dimensional vector representation
Text Embedding
Transformers Other

L
rufimelo
407
3
Ptt5 Base Summ Xlsum
MIT
A Brazilian Portuguese abstractive text summarization model fine-tuned on PTT5, supporting summarization for various text types including news.
Text Generation
Transformers Other

P
recogna-nlp
3,754
16
Deberta Base Combined Squad1 Aqa And Newsqa
MIT
A Q&A model based on DeBERTa-base architecture, jointly fine-tuned on SQuAD1, AQA, and NewsQA datasets
Question Answering System
Transformers

D
stevemobs
15
0
Wav2vec2 Xls R 1b Italian Doc4lm 5gram
Apache-2.0
Italian speech recognition model fine-tuned from XLS-R 1B parameter model, supports recognition with language model
Speech Recognition
Transformers Other

W
radiogroup-crits
19
1
Wav2vec2 Large Xlsr 53 Finnish
Apache-2.0
A Finnish automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Speech Recognition Other
W
Tommi
28
0
Wav2vec2 Large Xlsr 53 Greek
Apache-2.0
This is a Greek automatic speech recognition model based on the XLSR-Wav2Vec2 architecture, developed by the Hellenic Military Academy and the Technical University of Crete.
Speech Recognition Other
W
lighteternal
443
8
Mbart Large Cc25 Cnn Dailymail Xsum Nl
A Dutch news summarization model fine-tuned on mbart-large-cc25, supporting CNN/DailyMail and XSum format summarization tasks
Text Generation
Transformers Other

M
ml6team
129
4
Wav2vec2 Large Xlsr Hindi
Apache-2.0
Hindi speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Speech Recognition
Transformers Other

W
skylord
82
2
Rut5 Base Absum
MIT
This is a Russian abstractive summarization model based on the T5 architecture and fine-tuned on multiple datasets, capable of generating concise and accurate text summaries.
Text Generation
Transformers Other

R
cointegrated
1,135
27
Wav2vec2 Base Vn 270h
A speech recognition model fine-tuned with approximately 270 hours of Vietnamese annotated data, supporting Vietnamese automatic speech recognition tasks
Speech Recognition Other
W
dragonSwing
202
8
Wav2vec2 Large Xlsr 53 Finnish
Apache-2.0
A Finnish automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input
Speech Recognition
Transformers Other

W
vasilis
27
0
Wav2vec2 Large Xlsr Malayalam
Apache-2.0
A Malayalam fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Speech Recognition Other
W
gvs
29.57k
5
Camembert Base Squadfr Fquad Piaf
A French Q&A model based on CamemBERT, fine-tuned on three French Q&A datasets: PIAF, FQuAD, and SQuAD-FR
Question Answering System
Transformers French

C
AgentPublic
1,789
28
Wav2vec2 Large Xlsr 53 Finnish
Apache-2.0
This is an automatic speech recognition model fine-tuned on Finnish based on facebook/wav2vec2-large-xlsr-53, but has been marked as an old model, and it is recommended to use newer alternatives.
Speech Recognition Other
W
aapot
33
0
Dpr Question Encoder Fr Qa Camembert
A French DPR model based on CamemBERT, optimized for French Q&A tasks, fine-tuned on PIAF, FQuAD, and SQuAD-FR datasets
Question Answering System
Transformers French

D
AgentPublic
229
8
Xlsr 53 Wav2vec Greek
Apache-2.0
This is a Greek fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53, using the Common Voice and CSS10 Greek datasets.
Speech Recognition
Transformers Other

X
harshit345
19
1
Wav2vec2 Large 100k Voxpopuli Catala
Apache-2.0
A Catalan speech recognition model fine-tuned based on the VoxPopuli large model, trained on Common Voice and ParlamentParla datasets
Speech Recognition Other
W
softcatala
16
0
Wav2vec2 Large Xlsr Vietnamese
Apache-2.0
This is a Vietnamese fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53, trained using the Common Voice and Infore_25h datasets.
Speech Recognition Other
W
CuongLD
37
1
Wav2vec2 Large Xlsr 53 Greek
Apache-2.0
This is a Greek fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53, trained using the Common Voice and CSS10 datasets.
Speech Recognition Other
W
PereLluis13
21
0
Wav2vec2 Xls R 1b Ca Lm
Apache-2.0
This is a Catalan speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m, trained on multiple Catalan datasets.
Speech Recognition
Transformers Other

W
PereLluis13
3,758
4
Wav2vec2 Large 100k Voxpopuli Ft Common Voice Plus TTS Dataset Plus Data Augmentation Russian
Apache-2.0
A Russian speech recognition model fine-tuned on Facebook's Wav2vec2 Large 100k Voxpopuli model using Common Voice 7.0, M-AILABS datasets, and data augmentation techniques.
Speech Recognition
Transformers Other

W
Edresson
23
2
Wav2vec2 Base Voxpopuli Sv Swedish
A Swedish speech recognition model fine-tuned using NST and Common Voice data, based on Facebook's VoxPopuli-sv base model.
Speech Recognition
Transformers

W
KBLab
38
0
Featured Recommended AI Models